Persistence - Based Data Analysis and Classification
نویسنده
چکیده
The main goals of the project are the formalization and the development of a novel approach to the analysis, comparison and classification of multidimensional data, i.e. information characterized by features embedded in some multidimensional topological space. Our proposed project will be based on a topological exploration and understanding of data, finding its roots in the so-called (multidimensional) persistence, a theoretical and computational methodology recently emerged as a suitable tool for such purposes. The framework developed within the project does not aim to replace existing data analysis tools. It will provide a new tool enabling the use of additional information, hence improving the existing frameworks, making them more robust and stable. One further contribution will be the finding of important data properties with regard to the multidimensional persistence framework.
منابع مشابه
Statistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm
This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...
متن کاملImprovement of coal mine roof rating classification using fuzzy type-2
One of the main concerns of an underground coal mining engineer is the safety and stability of the mine. One way that the safety and stability can be ensured is to know and understand the coal mine geology and how it reacts to the mining process. One technique that has shown a lot of success in the coal mining industry for geologic technical evaluation purposes is the coal mine roof rating (CMR...
متن کاملFuzzy Data Envelopment Analysis for Classification of Streaming Data
The classification of fuzzy uncertain data is considered one of the most challenging issues in data analysis. In spite of the significance of fuzzy data in mathematical programming, the development of the analytical methods of fuzzy data is slow. Therefore, the current study proposes a new fuzzy data classification method based on fuzzy data envelopment analysis (DEA) which can handle strea...
متن کاملFuzzy Data Envelopment Analysis for Classification of Streaming Data
The classification of fuzzy uncertain data is considered one of the most challenging issues in data analysis. In spite of the significance of fuzzy data in mathematical programming, the development of the analytical methods of fuzzy data is slow. Therefore, the current study proposes a new fuzzy data classification method based on fuzzy data envelopment analysis (DEA) which can handle strea...
متن کاملA PHENETIC ANALYSIS OF THE SPECIES OF IRANIAN OXYTROPIS (FABACEAE) BASED ON MORPHOLOGICAL DATA
A phenetic analysis is presented using SAHN clustering (UPGMA, simple matching coefficient). The resulting phenetic analysis is compared with the classification of Oxytropis as suggested by Bunge [2] and Vassilczenko [ll]. The results partly correspond with Bungean classification and suggest that there are basically four sections (Mesogaea, Oxytropis, Janthina, Eumorphu) which can be group...
متن کامل